DiscoverLessWrong (30+ Karma)“Why Corrigibility is Hard, and Important [IABED Resources]” by Raemon
“Why Corrigibility is Hard, and Important [IABED Resources]” by Raemon

“Why Corrigibility is Hard, and Important [IABED Resources]” by Raemon

Update: 2025-09-30
Share

Description

I worked a bunch on the website for If Anyone Builds Its Online Resources. It went through a lot of revisions in the weeks before launch.

There was a particular paragraphs I found important, which I now can't find a link to, and I'm not sure if they got deleted in an edit pass or if they just moved around somewhere I'm failing to search for.

It came after a discussion of corrigibility, and how MIRI made a pretty concerted attempt at solving it, which involved bringing in some quite smart people and talking to people who thought it was obviously "not that hard" to specify a corrigible mind in a toy environment.

The paragraph went (something like, paraphrased from memory):

The technical intuitions we gained from this process, is the real reason for our particularly strong confidence in this problem being hard."

This seemed like a pretty [...]

---

Outline:

(03:21 ) Intelligent (Usually) Implies Incorrigible

(10:42 ) Shutdown Buttons and Corrigibility

(23:42 ) A Closer Look at Before and After

---


First published:

September 30th, 2025



Source:

https://www.lesswrong.com/posts/ksfjZJu3BFEfM6hHE/why-corrigibility-is-hard-and-important-iabed-resources


---


Narrated by TYPE III AUDIO.

Comments 
In Channel
loading
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

“Why Corrigibility is Hard, and Important [IABED Resources]” by Raemon

“Why Corrigibility is Hard, and Important [IABED Resources]” by Raemon